Fisher information and thermodynamics' 1st. law 
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It was recently shown, starting from first principles, that ther- 
modynamics' first law (TFL) can be microscopically obtained 
without need of invoking the adiabatic theorem (AT) [Phys- 
ica A, 356, 167 (2005)]. We show here that a TFL can also 
be found for Fisher's information measure, following a similar 
procedure. Further, it is proved that enforcing the Fisher- 
first law requirements in a process in which the probability 
distribution is infinitesimally varied is equivalent to minimiz- 
ing Fisher's information measure subject to appropriate con- 
straints. 

Pacs: 05.30.-d, 05.30.Jp 



I. INTRODUCTION 

The study and revision of some of the principles of sta- 
tistical mechanics is still an active field (see, for instance, 
among many valuable works, [1-7]). In particular, it was 
shown in [8] that the entire Legendre-transform (LT) of 
thermodynamics can be microscopically reproduced us- 
ing Fisher's information measure in place of Boltzmcnn's 
entropy. This abstract LT constitues an essential ingre- 
dient that permits one to build up a "statistical mechan- 
ics" . Fisher information / allows for such a construction. 
The desired concavity property of/, demonstrated in [8], 
further illustrates on its utility as a "generator" of an un- 
derlying microscopic explanation for thermodynamics. 
In another, totally disctint vein, it was also recently 
shown, starting from first principles and using the point 
of view of information theory, that thermodynamics' first 
law (TFL) can be microscopically obtained without 

• the need of invoking the adiabatic theorem (AT) 
with 

• its awkward dependence on hypothetical external, 
infinitely slowly varying parameters, and also, 

• without needing some other usually invoked con- 
straints and arguments [9]. 

We will show here that a "TFL" can also be found for 
Fisher's information measure rather than Shannon's, fol- 
lowing a similar procedure. As a corollary result we 
demonstrate that, given this Fisher TFL, using a pro- 
cess in which a sought after unknown probability distri- 
bution / is infinitesimally varied is equivalent to seeking 



/ by minimizing Fisher's information measure subject to 
appropriate constraints . 



II. PRELIMINARIES 

One of the most important extant information measures 
was advanced by R.A. Fisher in the twenties -for details 
and discussions we refer to (for instance) [10-12]-. Fisher 
information (FI) arises as a measure of the expected error 
in the measurement of a parameter 9 in a situation gov- 
erned by a family of probability densities p(x,9), where 
x is, of course, a random variable, and is the parame- 
ter characterizing the alluded mono-parameter family of 
probability densities [10]. It reads 



iGeneral = / dx p(x, 9) 



<91np 
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(1) 



A particular Fl-case of great importance [10] is that of 
translation families [10,13], that is, distribution functions 
of the form 



p(x,9) = p(x 



(2) 



In this particular case 9 is a shift parameter. In other 
words, two probability densities with different values of 
9 have the same shape, but are displaced with respect to 
each other. It is clear that 



and, consequently, 



dx p(x, 9) 



dp 
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dp 



dp 

dx ' 



dx p{x, 9) 



dp 

dx 



(3) 



(4) 



From now on, and in order to simplify the notation, the 
9— dependence will not be explicitated. Thus, the classi- 
cal Fisher information associated with translations of a, 
for simplicity's sake, one-dimensional observable x with 
probability distribution f(x) becomes [14] 



I = J dxf(x) 



ainf(x) 



dx 



(•5) 



and the associated Cramer-Rao inequality [10,14] satis- 
fies the Cramer-Rao relation 



1 



Ax > r 1 (6) 
where Ax is the variance of the stochastic variable x [14] 



Ax-/dx,(x)^-(/dx,(x)x) 2 , 



(7) 



and represents the mean-square error associated to the 
above referred to measurement. Notice that, unlike the 
case for Shannon's measure, / is NOT a function of fix) 
alone, but also of its derivatives. 

We will derive a relationship involving the internal en- 
ergy, Fisher information and temperature, a generalized 
pressure and the known a priori input constraints of mea- 
surement data. Having constructed this relationship, it 
will be most simply interpreted as the desired TFL. 



III. A RELATION FOR THE DIFFERENTIAL OF 
FISHER'S MEASURE 

A. Notation 

We start presenting now our new results. We proceed 
first of all to derive a relationship for dl. For this we 
need to deal with the internal energy 

U = (Ox) = (H(x)) = Tr[fH] = J dx f(x)H(x), (8) 

where we have used the abbreviation 

Trf = J dxf(x), (9) 

and consider variations Sf of the distribution function, 
i.e., a process of the type 



f{x) - f{x) +Sf{x), 



(10) 



where normalization-preservation entails that, during the 
process, 



Tr[6f] = STr[f] = 0. 



(11) 



We also have, of course, for other physical quantities 

Oiix) 

Tr{8f(x)O t (x)]=d(O t (x)). (12) 



B. The Fisher TFL 

We are going now to rely heavily on the results of Ref. 
[8]. It was shown there that, if we are discussing a ther- 
modynamical setting involving M extensive variables Oi , 
we can recover the formal, Legendre-structure of thermo- 
dynamics if we minimize the Fisher information measure 



I subject to appropriate constraints. These are given by 
the supossedly a-priori known values 

Constraints in the minimization process are : 
(Oiix)) = a h (i = 1,...,M)) plus 
Tr[f] = 1, normalization of /. (13) 

It turns then out that the extremized measure I is a 
function only of the extensive variables 

J = /«<?! (*)>, <O 2 (x)),...,<O M (a0)), (14) 

and that, further [8], 

dl 



d{O k ix)) 



(15) 



where Afe, (ft = 1, ... , M) are M Lagrange multipliers 
that arise in the course of the minimization procedure. 
From (14) and (15) it is now clear that 



M 



dl 



i.e., 



ti d(Oiix)) 



M 



d(Oiix)), 



dI = J2\id(Oiix)). 



(16) 



(17) 



We explicitly separate now the energy term, and define 
/3 = A i= i = l/T. Here T will be a "Fisher-temperature" 
(FT), whose existence was conjectured in [10,11] and 
proved in [8]. Its properties have been extensively dis- 
cussed in Ref. [15]. The FT behaves as l/T ort hodox, so 
that an ordinary thermometer, conveniently recalibrated, 
can measure it. Accordingly, a term of the form Tdl is 
the Fisher-equivalent of a "heat" increment dQF [15] and 
we are in a position to write 



M 



dl = [3dU + J2 \d(Oiix)). 



(18) 



i=2 



Introducing, finally, the "generalized pressures" 

Pj =TXj, (j=2,...,M), (19) 
(18) leads us to 

M 



dU = TdI -J2 Pi d(Oiix)), 



(20) 



i=2 



which is a "Fisher's thermodynamics" first-law: note that 
the variations of the extensive quantities are not indepen- 
dent one of another. 

If we call the differential of work (dW), effected at tem- 
perature T, in the form of 



2 



M 



dW = J2Pid(Oi(x)), 



(21) 



i=2 



and, as stated above, recognize that Tdl plays the role of 
a heat-change dQp, as expected (but not explicitly dis- 
cussed there) from the philosophy espoused in [8] . Thus, 
we have here shown that work is represented by changes 
in expectation values. These constituted part of our prior 
knowledge. If a posteriori we encounter changes, this en- 
tails that work has been performed, on or by the system. 



IV. TFL AS A GENERATOR OF THE 
PROBABILITY DISTRIBUTION F 

As a corollary to the above results we infer what would be 
the case if we turn things around and take the TFL (16) 
as the given, and want to find the unknown probability 
distribution /. By considering a change from / to f + Sf, 
with Sf arbitrary, but constrained so that the TFL (16) is 
obeyed in effecting the probabilities change, we obtain, as 
a result of that constrained change, the very same / that 
is obtained by the method of MaxEnt that maximizes 
Fisher information subject to constraints [8]. 
We now demonstrate this result formaly, in order to make 
all the conditions and steps explicit, as a THEOREM: 
HYPOTHESIS: If we constrain the process (10)-(11) forc- 
ing it to comply with (20), 

THESIS: this results in a probability distribution that mini- 
mizes I subject to the constraints 



(Oi(x))= ai , (i = l,...,M), 



(22) 



where (3 and the A's are the quantities appearing in (20). 
We are thus demanding simultaneous fulfillment of (10)- 
(11), which leads to the two requirements 



where the Oj are real numbers supposedly known a-priori. 
PROOF 

The process (10)-(11) generates infinitesimal changes in 
respectively I, U, and the (Oi(x)}, (i = 2, . . . , M), and 
these infinitesimal changes are constrained in the fashion 
(20). Thus, for the change Sf prescribed by (10)-(11) we 
have (as shown in [8]), starting from the J— definition (5), 

dI = JdxK[f] Sf, 
with a functional — derivative K = <51/<5f of the form 

K + (23) 

while the changes in the (Oi(x)), (i = 2, . . . , M) are gov- 
erned by (12), this is, 



S f [(O m (x))]=S f Tr[O m (x)f(x)} 
= Tr[O m (x)Sf], (Vm). 



(24) 



Of course, / is not yet known. We wish to "extract" / 
from the 1st Law-relation (20). We introduce now, for 
future reference, the function W(x) defined as 



M 



(25) 



(1) / dx [K(x) - (iH{x) - YZ2 ^ (Oi(x))]5f = 
= j dx [K(x) - W(x)]6f = 
(2) / dx Sf = 0. 



(26) 



The second one is fullfilled by any arbitrary variation 
Sf (x) such that any possible area under the pertinent 
curve above the x— axis is compensated by another simi- 
lar one under that axis. Now, a special type of variation 
among these allow us to obtain an important result from 
the first relation in (26): consider for the associated inte- 
gral a Sf(x) that consists of two equal rectangles of van- 
ishing widths and height C(x), centered at points x\ and 
X2, respectively, such that one of them points upwards 
and the other downwards (and thus the contributions of 
their areas is zero). Since the two points x\, X2 may 
be located anywhere along the x— axis, we are forced to 
conclude that the rectangle's height, given by 



C(x) = [K(x) — VF(x)] = constant = —a, 



(27) 



which is identical to Eq. (13) of [8]. It is therein shown 
that, (i) setting the probability distribution / equal to 
the square of some amplitude ip 



f(x) = ^{xf, 
(ii) defining the new variable 

91n^(x) 



v(x) 



(28) 



(29) 



(iii) one obtains a Ricatti equation from which one imme- 
diately gets for the amplitude a Scroedinger-like equation 
[8] 



1 M 

-^X! Ai Oi(x)ip(x) = atp(x). 



(30) 



Now, (30) yields simultaneously: 



• the probability distribution that minimizes I sub- 
ject to the constraints posed by (i) normalization, 
with Lagrange multiplier a and (ii) (22), with La- 
grange multipliers Ai, as shown in [8] 

• as demonstrated here, the probability distribution 
that guarantees that the process (10)-(11) is ef- 
fected in such a manner that the first law require- 
ments (20) are satisfied, with a equal to minus the 
rectangle's height in (27) and the Aj's equal to the 
ratio of the generalized pressures to Ai = (i, i.e., 
Eq. (19). 

The theorem is proved. 
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V. DISCUSSION AND CONCLUSIONS 

Two new results have been advanced here: 

• We have shown that, within Fisher information the- 
ory context, one can derive thermodynamics first 
law for the Fisher Thermodynamics without ap- 
peal to the adiabatic theorem, or to any explicit 
dependence on hypothetical external parameters. 
Thus, we avoid the need to add, to the theoret- 
ical description, putative infinitely slowly varying 
external parameters so as to obtain the first law. 
We find work is represented by changes in expec- 
tation values. These constituted part of our prior 
knowledge. If a posteriori we encounter changes, 
this entails that work has been performed, on or by 
the system. 

• As a corollary we demonstrated that, given the 
Fisher TFL, using a process in which a sought af- 
ter unknown probability distribution / is infinites- 
immally varied is equivelant to seeking / by mini- 
mizing Fisher's information measure subject to ap- 
propriate constraints. Plastino and Curado have 
shown that similar results would obtain for a large 
class of additional entropies, namely, those that 
depend only on / [16], unlike the Fisher measure 
treated here, that in addition depends on deriva- 
tives of / as well. All these entropies with the nec- 
essary concavity property, and Legendre-transform 
structure of thermodynamics, allow the construc- 
tion of a "statistical mechanics" [16]. 
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